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Abstract 

Background: Rabies virus (RV), the agent of rabies, can cause a severe encephalomyelitis in several species of 
mammals, including humans. As a human rabies vaccine strain employed in China, the genetic knowledge of the 
aG strain has not been fully studied. The main goal of the present study is to amplify the whole genome of aG 
strain, and genetic relationships between other vaccine strains and wild strains were analyzed. 

Results: The entire genome of human rabies virus vaccine strain aG employed in China was sequenced; this is the 
second rabies virus vaccine strain from China to be fully characterized. The overall organization and the length of 
the genome were similar to that of other lyssaviruses. The length of aG strain was 1 1925nt, comprising a leader 
sequence of 58nt, nucleoprotein (N) gene of 1353nt, phosphoprotein (P) gene of 894 nt, matrix protein (M) gene 
of 609nt, glycoprotein (G) gene of 1575nt, RNA-dependent RNA polymerase (RdRp,L) gene of 6384nt, and a trailer 
region of 70 nt. There was TGAAAAAAA (TGA 7 ) consensus sequence in the end of each gene, except AGA 7 at the 
end of G gene. There was AACAYYYCT consensus start signal at the beginning of each gene. 

Conclusions: In this report, we analyzed the full genome of China human rabies vaccine strain aG. Our studies 
indicated that the genome of aG retained the basic characteristics of RV. At gene level, N was the most conserved 
among the five coding genes, indicating this gene is the most appropriate for quantitative genotype definition. 
The phylogenetic analysis of the N indicated the aG strain clustered most closely with Japanese and Russian rabies 
vaccine strains, suggesting that they may share the same ancestor; also, the aG strain did not share high homology 
with wild strains isolated from China, making it may not be the best vaccine strain, more research is needed to 
elucidate the genetic relationship among the RV circulating in China. 

Keywords: Rabies virus, aG, China, full-length genome 



1. Background which encodes the nucleocapsid protein (N), phospho- 

Rabies is a widespread neurological zoometric disease protein (P), matrix protein (M), glycoprotein (G) and 

which affects almost all kinds of mammals, including RNA-dependent RNA polymerase (large protein), 

humans. The morality is almost 100%. It is reported Between the monocistronic there are four intergenic 

that nearly 55,000 human die of rabies all over the regions (IGRS) with different length [3]. Besides, at the 

world each year, most occurred in Asia and Africa [1]. end of the genome there are two underinflated regions 

Following India, China has the second highest number (UTRS), which play an important role in virus 

of human cases in the world [2]. The causal agent of the replication. 

disease is rabies virus (RV), a member of Lyssavirus So far, vaccination is the most effective method to 

genus, Rhabdoviridae family. The RV genome is almost prevent rabies. The vaccine strains employed in China 

12 kb in size consisting of five monocistronic RNAs, are CTN, aG, PM and PV. The complete genome 

sequence of CTN strain has been sequenced already [4]. 
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the virus strain used for the production of vaccine are 
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still pathogenic for laboratory and wildlife spices, what is 
more important, these rabies vaccines have the potential 
of causing vaccine-induced rabies and appear to have 
limited efficacy in a broad spectrum of animal species 
[5]. Therefore, genetic knowledge of vaccine strain is 
essential. Also, comparison of sequence of vaccine strain 
and field strain circulating in the country would prove 
how effective of the vaccine against the field virus [6]. 

The aG strain, also known as pG strain, was obtained 
from a rabid dog in 1931, Beijing. After 50 passages in 
the brain of rabbit, the strain was named aG strain. 
Then, the aG strain has been adapted to VERO cell cul- 
tures after 68 passages. Much higher antibody was 
obtained in immunized dogs, and the morality was 
lower. Since then, the aG strain has been chosen for the 
production of human rabies vaccine. Before this study, 
P, M, G and L genes of aG strain have been sequenced; 
however, this is the first time that the complete genome 
has been sequenced. In the present paper, the complete 
sequence of the rabies virus vaccine strain aG employed 
in China was sequenced. The sequence obtained indi- 
cates that aG strain genome is similar to the sequence 
of other complete lyssavirus genome available in Gen- 
bank. Sequence comparison with other rabies virus 
strain and phylogenetic analysis according to the N gene 
also falls within the scope of this study. 

2. Materials and methods 

2.1. Virus strain 

The aG strain was prepared and stored by State Key 
Laboratory of Veterinary Etiological Biology, Lanzhou 
Veterinary Research Institite, Chinese Academy of Agri- 
cultural Science (LVRI, CAAS). The parental virus was 
isolated from a rabid dog's brain from Beijing, China, of 
a patient died of rabies from Beijing 1931, China. After 
31 passages in primary hamster kidney (PHKC) cell, it 
was chosen for vaccine production in China [4]. 

2.2 Primer design 

According to the conserved regions of the genome of 
rabies virus published in GenBank, particular the RC- 
HL(Genbank number: AB009663), RV-97(Genbank 
number EF542830), Nishigahara(Genbank number: 
AB044824) and NI-CE(Genbank number: Abl28149), 12 
pairs of primers were designed to amplify the unknown 
regions of aG strain by primer premier 5.0 software 
(Table 1). All primers were synthesized by Shanghai 
Sangon Biological Engineering Technology & Service 
Co., Ltd. 

2.3 Reverse transcription-PCR and direct sequencing 

Total RNA were extracted from lyophilized brains of 
mice infected with aG strain by using an Rneasy Mini 
Plus Total RNA extraction kit (Qiagen) according to the 



Table 1 Primers employed for the amplication of 
unknown genomic RNA of rabies virus aG strain 



n ciy 1 1 ici i la 


Cpri i ipnfpfl'-^') 


1 onnthc 
Lcl iy LI ly 

(bases) 


Base 
position 


F1 a p b 


ACGGTAACAACAAGATCA 


19 


1-1581 


F1R 


CTTCAGCCATCTCAAGAT 


18 




F2F 


TATGTCTCAGTCAGTFCC 


18 


1 340-2933 


F2R 


TCATCGCAAGTGATCTCC 


18 




F3F 


ATACTCTGGGAATCATAGGAT 


21 


2742-4280 


F3R 


TGAGACGTCTGAAACTCACTG 


21 




F4F 


AACATGGGTCGCGATGCA 


18 


4090-5453 


F4R 


GGTCATCATAGACCTCTC 


18 




F5F 


C~ AT/ — P A f~ ATT/ — XT AT A AT 

LA 1 Lj 1 ALA 1 I L 1 LA 1 AA 1 


1 8 


5U63-04 1 U 


F5R 


TA A AT^rfl/TA A A 1 \~t — Ff~ 

1 AAA 1 ALAGGAAAG L 1 L 


1 8 




F6F 


TGTGGAAACTGCGGCTAT 


18 


6290-7430 


F6R 


CGATGAGGTCTGATCTGTCTG 


21 




F7F 


TGTTFGGATFGAAGAGAGTGT 


21 


7344-8513 


F7R 


GCTCACTGAGAAATCGAG 


18 




F8F 


TTCAGAGTTTCGAGAGGCAAT 


21 


8410-9648 


F8R 


ATGTTGACAGGGAAGATGGTT 


21 




F9F 


ATGTTCCAGCCATrGATGCTT 


21 


9272- 
10598 


F9R 


TGAACACAAGCTFGGCATCAG 


21 




F10F 


TGCTCTGCTCAACAGGTT 


18 


10331- 
11932 


F10R 


ACGCTTAACAAATAAACAACA 


21 




F11F 


GGTCTGGCGACACCCCGGTGTFCA 


24 




F11R 


GAGTCTGTCATCTCACTGGATCA 


23 




F12F 


TGCTGCTGCCAAGTAGGAACAT 


22 




F12R 


AGTTTGGCGGCATCCATGCCTG 


22 





a The 12 fragments are labeled from "F1" to "F12" 

b "F" and "R" strand for forward and reverse direction of the primer extension 



manufacture's instruction. Single cDNA was synthesized 
by using First Strand cDNA Synthesis Kit (ToYOBO), 
and the cDNA were amplified using KOD Plus Neo 
(TOYOBO). Amplified DNA products were examined 
via electrophoresis using 0.8% ultrapure TM Agarose 
(Invitrogen) gels. DNA products of expected size were 
purified using Agarose Gel DNA Purification Kit 
(TAKARA). The purified products were added to A at 
the end of 3'end using DNA A-tailing Kit (TAKARA) 
and then ligated with pMD18-T (TAKARA), the ligated 
products were transformed into competent cell JM109 
(TianGen), at least 3 clones carrying the PCR products 
of expected size were identified by PCR and the positive 
clones were sequenced commercially (Shanghai Sangon 
biological Engineering Technology &Services Co., Ltd., 
China). 

2.4 Amplification and sequencing of the terminal ends 

The 5'and 3'terminal of the genome was confirmed 
using the approach described by Kuzmin et al [7]. 
Briefly, Total RNA was subjected to ligation by T4 RNA 
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ligase (Invitrogen) to circularize the genome. Thereafter 
the ligated RNA was subjected to nested PCR with the 
primer marked with F 11F, 11R, 12F and 12 R located 
within the 5'and 3'end of the genome. Products of the 
expected size were dealt with above. 

2.4 Sequence alignment and analysis 

Nucleotide sequences obtained were edited and 
assembled manually using EditSeq of DNAStar program 

7.0 (DNAStar). Phylogenetic analysis of N gene of Lyssa- 
virus, including the construction of phylogenetic tree 
were conducted using Molecular Evolutionary Genetics 
Analysis (MEGA) Version 5.0 with the methods of 
neighbor-joining (NJ) algorithm with the Kimura two- 
parameter model. The reference sequences have been 
listed in Table 2. The reliability of the phylogeny group- 
ings was evaluated using bootstrapping with 1000 repli- 
cates. Bootstrap values of 70% or greater were viewed 
significant. 

3. Results 

3.1 Genome organization of the aG strain 

Using a total of 24 primers (as shown in table 1) the 
strain aG genome was obtained as 12 separate overlap- 
ping PCR products. The 5'- and 3'-terminal were 
obtained as mentioned above. The length of the gen- 
ome is 11925 nucleotides(nt), and the genome organi- 
zation of aG strain, which follows typical rabies virus 
organization, is summarized as follows: a 3' leader 
region of 58 nt(l-58), the N gene (59-1482), P gene 
(1485-2476), M gene (2482-3284), G gene (3290-5356), 
L gene (5381-11855), and the 5'trailer region of 70 nt 
(11855-11925). The coding sequence (CDS) of each of 
the five structural genes are as follows:1353-nt nucleo- 
tide (71-1423), 894-nt phosphoprotein(1515-2408), 
609-nt matrix protein (2497-3105), 1575-nt glycopro- 
tein (3317-4891) and 6384-nt large protein(5411- 
11794). The G-L non-coding region is 455nt. The 
sequences of the transcription initiation sites (TIS) and 
the termination signals of the structural genes and the 
intergenic regions (IGRS) among these genes are 
shown in table 3. 

3.1.1. Nucleotide feathers of strain aG 

The length and sequence of leader region is highly con- 
served in all lyssavirus. The length of the leader region 
is 58nt. The first 12 nucleotides, especially the first 9 
are the same in all the lyssavirus. However, the conser- 
vation is not so stringent after residue 25. Following 
most lyssavirus, the trailer regions of aG strain is 70nts. 

The complementarity of the 3'and 5' ends of the 
genome is another typical feather of the Mononegavir- 
als, with up to 16 nts are complemented. The first 9 nts 
of the genome leader regions are absolutely 



complementary to the corresponding regions of the trai- 
ler regions [7] (Figure 1). 

The transcription initiation signals (TISs) and tran- 
scription termination signals (TTS) are conserved as 
AACAYYHCT and G (A) 7 , but the TTS between G-L 
was AGAAAAAAA at the N-P junction. The intergenic 
regions between the cistrons CT, CCGAA, ACTATT 
and CTGTAGATCGAAAGAGCAACTGGC (Table 3). 
Pairwise comparisons of nucleotide and predicted amino 
acid sequence identities of the aG with the vaccine 
strain of lyssaviruses are listed in Table 4. 

3.2 Structural feathers of the proteins of aG strain 

Consistent with previous studies, at the nucleotide level 
N was the most conserved gene [8]. The N protein of 
aG strain is 450 amino acid residues. Only 10 A i a amino 
acid substitution is unique to the strain aG. Antigenic 
site I (residues 358-367), antigenic site IV (375-383, 
359-366) and RNA-binding domain (residues 298-352) 
were found to be conserved in all the isolates analyzed. 
Ser389, which was considered to be related to casein- 
type phosphorylation site and regulation of viral RNA 
transcription and replication, was highly conserved [9] 
(Figure 2). 

P gene was considered to be the most divergent pro- 
tein among the five structural proteins. Six amino acid 
substitutions, Lys 68 , Tyr 95 , Thr 174 , Thr 184 , Glu 254 and 
Glu 295 , unique to the aG were observed. The interaction 
of the motif [(K/R)XTQT] between residues 145-149 
with cytoplasmic dynein light chain (LC8), which plays 
an important role in viral nucleocapsid axoplasmic 
transport, was encoded as KSTQT [10]. Compared with 
vaccine strain, the N protein binding site (69-177 and 
268-297) of aG strain had one unique amino acid resi- 
dues substitution, Tyr 95 . The L binding site in P, the 
first 19 amino acid residues, was absolutely conserved in 
aG. Ser 162 , Ser 2 io, Ser 271 , which have been shown to be 
involved in the phosphorylation of P were well con- 
served in the aG strain [11] (Figure 3). 

M protein connects the plasma membranes, RNPS and 
G protein together, and is deeply involved in the bud- 
ding of rabies virus. Thr 100 , Ser lu and Ser 174 were the 
three amino acid substitution to the strain aG. The 
motif proline-rich (PPxY) between residues 35-38 is 
thought to be associated with the interaction with WW 
domains of cellular components, and was encoded 
PPEY, which is very conserved [12]. Residue 58, which 
is crucial for regulation of RV RNA synthesis [13] (Fig- 
ure 4), was encoded Glu in aG strain and Gly in PV 
strain. 

The G protein of 514 amino aicd residues is the only 
viral protein exposed on the surface of the virus and 
responsible for the immune responses of the host. Com- 
pared with vaccine strain analysed in this study, there 
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Table 2 Complete genome of rabies virus used in this study from Genbank 



Strain 



Origins 



Country 



Accession 
no 



Length 



CTN181 

Nishigahara 

SAD-B19 

RC-HL 

RV-97 

PV 

ERA 

SRV9 

NI-CE 

HEP-Flury 

DRV 

MRV 

HN10 

SHBRV-18 

BD06 

NNV-RAB-H 

SAG2 

8743THA 

8764THA 

H-08-1320 

RRV ON-99-2 

BR-Pfxl 

9704ARG 

9147FRA 

9001 FRA 

FJ08 

FJ09 

D01 

D02 



Attenuated vaccine strain, derived from the brain tissue of a patient in Shandong, China EF564174 
China 

Seed strain of animal rabies vaccine strain used in Japan Japan AB044824 

A highly attenuated mutant of SAD strain America AB044824 

Animal vaccine strain used in Japan Japan AB009663 

Vaccine strain employed in Russia Russia EF542830 

Pasteur vaccine strain France M13215 

Attenuated rabies vaccine strain America EF206707 

Avirulent vaccine strain China AF499686 

Developed from Nishigahara Japan AB128149 

Derived from a human isolate in Georgia, USA, 1939 America AB085828 

Isolated from a deer in Ji lin, China China DQ875051. 

Isolated from a mouse in China China DQ875050. 

Isolated fron a rabies patient in Hunan, China China EU643590 

Isolated from a silver-haired bat America AY705272 

Isolated in China China EU643590 

Isolated from the brain of human India EF437215 

Vaccine strain Derived from SAD America EF206719 

A street strain Thailand EU293121 

A street strain Thailand EU293111 

Isolated from the brain of human Sri Lanka AB569299 

Isolated from a raccoon Canada EU311738 

Isolated from Dusicyon sp Brazil AB362483 

Isolated from tadarida brasiliensis Argentina EU293116 

Isolated from fox France EU293115 

Isolated from dog Guyana EU293113 

Isolated from dog China FJ866835 

Isolated from dog China FJ866836 

Isolated from the brain of dog China FJ12193 

Isolated form the brain of dog China FJ 1 2 1 94 



1923 



bp 



1926 



bp 



bp 



1926 



bp 



1932 



bp 



1932 



bp 



1931 



bp 



bp 



1926 



bp 



1615 



bp 



1863 



bp 



bp 



1923 



bp 



1923 



bp 



1924 



bp 



1928 



bp 



bp 



1923 



bp 



1925 



bp 



1926 



bp 



1923 



bp 



1924 



bp 



1923 



bp 



1923 



bp 



1922 



bp 



1924 



bp 



1924 



bp 



1925 



bp 



1925 



bp 
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Table 2 Complete genome of rabies virus used in this study from Genbank (Continued) 



MOKOLA 


Isolated from Mokola Virus 


Africa 


NC_006429 


11940 
bp 


KRC5-04 


Isolated from dog 


South 
Korea 


AY730597 


1353 bp 


HNDB28 


Isolated from dog 


China 


EU008922 


1353 bp 


HNDB12 


Isolated from dog 


China 


EU008920 


1353 bp 


GXPL 


Isolated from Canis lupus familiaris 


China 


GQ472474 


1353 bp 


HebeiO(H) 


Isolated from homo sapiens 


China 


EU267777 


1353 bp 


Zhejiang Wz1(H) 


Isolated from homo sapiens 


China 


EU700032 


1353 bp 


West Caucasian bat 


Isolated from West Caucasian bat 


Russia 


EF614258 


12278 


virus 








bp 


Flury-LEP 


Derived from a human isolate in Georgia, USA, 1939 


America 


DQ099524 


11711 

bp 


JS08-45 


Isolated from Chinese ferret badger 


China 


GU647092 


11922 
bp 



are eight amino acid substitutions unique to aG strain: 
His 6 9, Proi84, Pro 25 o» Gly 427 , He 4 3i, He 477 , Lys 4 si, Asn 160 . 
Antigenic site I (residue 231), antigenic site II(residues 
34-42,198-200), antigenic site III (residues 330-338), 
antigenic site IV(residue 264) and antigenic site a (resi- 
due 342) were conserved. Ala 24 2, sp 2 55, He 26g and 
Arg 33 3, which are considered to be associated with viral 
pathogenicity, were replaced with Cys, Met, Thr and Ala 
respectly (Figure 5). 

L protein is an important component of RNP, 
although the N and P protein are also needed to form 
RNP. The 2,127 amino acid residues of L protein are 
distributed into six conserved motifs. 

Three potential N-glycosylation sites, which are 
located at 37-39, 158-160 and 319-321 were conserved 
in aG. The L gene of aG encoded a protein of 2,127- 
amino acid, and thirteen amino acid substitutions 
unique to strain aG were observed: Gly 4 8, Gh/58, Gln 18 9, 
Tyr 353 , Gly 489 , Gly 496 , Ser 606 , Lys 995 , Vali 043 . Serii 33 , 
Tyr 158 8, Leui 658 , Hisisoi- The motifs among 544-563(A), 
728-732(B) and 1705-1710(C) have been regarded as 
functional motifs. These motifs are involved in RNA 
binding, active site of polymerase, polyadenylation or a 
core of ATP binding site. All these amino acid residues 
were conserved in strain aG (Figure 6). In the absence 
of clear functions for these residues, the importance of 
any of these changes is presently unknown. 

3.2 Phylogenetic analysis of N 

As shown by the phylogenetic analysis of the nucleopro- 
tein gene (Figure 7), the three vaccine strain Nishiga- 
hara, NTCE and RC-HL were grouped with aG most 
closely. 

4. Discussion 

In the present study, the entire aG genomic sequence, 
which is composed of 11925 bp, was determined by 12 



paris overlapping fragments covering the whole genome 
to improve our knowledge of the genetic relationship 
between aG strain and other strain of rabies virus avail- 
able in Genbank. In contrast to the previous report [14], 
the length of the whole genomes in GT 1 lyssaviruses 
varies from 11711-nt to 12278-nt, with both even and 
odd numbers of nucleotides being represented [15,8]. In 
contrast to previous studies, the laboratory adapted vac- 
cine strain aG has a odd nucleotide, while other labora- 
tory isolates, such as Nishigahara, PV and SAD-B19 
strain, have genomes of even nucleotides numbers [16]. 
It seems that genomes of odd numbers of nucleotides 
are more common in wild type strains than fixed strain 
but both odd and even numbers of genomes occurred in 
wild type strains and fixed strain. It remains to be deter- 
mined that whether the isolates with odd or even num- 
bers of the genome has any biological significance. 
However, the number of the sequence does not follow 
the 'rule of six' [17]. 

On average, the G+C content of the lyssaviruses gen- 
ome is 44.57%, the G+C content was 45.18 mol% for aG 
strain. This is in agreement with the idea that the G+C 
content of the negative-stranded RNA viruses is lower 
than that of positive stranded RNA viruses, which has 
been considered to be associated with host cell RNA 
editing [18]. In agreement with previous studies, the 
genome of aG strain is similar to that of other lyssa- 
viruses with few variations. The complete nucleotide 
length of aG strain is ll,925nt, which was one, seven 
and three nucleotides shorter than that of Nishigahara, 
PV and SAD-B19 strain. Transcriptional initiation and 
termination signals of aG strain at the beginning and 
end of the each monocistron, including the motif 
AACAYYHCT initiatite transcription and a nine nucleo- 
tide motif WG(A) 7 , except the AGAAAAAAA TTP at 
the N-P junction, which is involved in transcription ter- 
mination and polyadenylation (TTP), are precisely the 



Table 3 Transcription initiation and termination signals for all strain aG 



Gene or region 



Termination signal 



IGRs 



Initiation signal 



3'leader/N 

N/P 

P/M 

M/G 

G/L 



TGAAAAAAA 
TGAAAAAAA 
TGAAAAAAA 
AGAAAAAAA 



CT 

GGAGC 
GTATT 

CTGTAGATCGAAAGAGCAACTGGC 



AAGACCTGT 
AACACCCGT 
AACACCACT 
AACATCCCT 
AACACCTCT 
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5 ' ACGCTTAACAACAAGATC AAAGAAGAAATATACAGCGTCATTTGCAAAG 3 ' 
3'TGCGAATTGTTT ATT TGTTGT T T CTGTTTTCTTTGTT AGTTTGTCGGTC5 ' 

Figure 1 Comparison of the 3' and 5' genomic termini of the antigenomic (+) sense RNA of aG strain 



same as described by Marston. All the intergenic 
sequences (IGS) in the aG agree precisely with what was 
described in that report. Unlike PV and SAD strains, the 
G-L intergenic region of aG strain has only one polyade- 
nylation site. The 3' and 5' of the genome not only play 
an important role in transcription, replication and the 
switching between these two functions, but also in the 
initiation of encapsidation. The 11 nts at the leader and 
trailer region of the genome were complementary con- 
served. These regions showed that the basic structure of 
rabies virus have been retained in strain aG [19]. 

It is reported that the 3'NCR of G mRNA to be the 
pseudogene, because there were two TTP motifs in PV. 
However, in aG strain, there is only one TTP motif, 
which holds true for many other isolates, including 
laboratory fixed strains and street rabies virus strains. 
Thus, a conclusion can be drawn that the psdudogene is 
only part of non-translated region of G mRNA. 



Table 4 Pairwise comparisons of nucleotide and 
predicted amino acid sequence identities of the aG with 
the vaccine strain of lyssaviruses 



Strain 




N (%) 


P (%) 


M (%) 


G (%) 


L (%) 


SRV9 


Nucleotide 


92.5 


89.6 


91.1 


91 


92.2 




Amino acid 


96.2 


88.9 


87.7 


89 


95.3 


CTN181 


Nucleotide 


87.3 


82.9 


83.9 


84.1 


25.1 




Amino acid 


96 


87.2 


93.1 


88.8 


5.9 


ERA 


Nucleotide 


92.6 


89.7 


92.3 


91.4 


92.4 




Amino acid 


96.2 


89.3 


90.6 


89.9 


95.9 


Flury-LEP 


Nucleotide 


92.1 


88.6 


92.6 


90.5 


91 




Amino acid 


96.9 


87.9 


93.6 


91.2 


96.5 


Flury-HEP 


Nucleotide 


92.1 


87.9 


91.5 


89.8 


91.5 




Amino acid 


96.9 


87.2 


92.1 


89.9 


96.4 


NI-CE 


Nucleotide 


97.9 


95.6 


96.9 


97.1 


98.6 




Amino acid 


98.4 


93.3 


96.1 


96.6 


98.6 


Nishigahara 


Nucleotide 


98.1 


96.8 


97.2 


97.1 


98.2 




Amino acid 


99.1 


95 


97 


96.4 


98.7 


PV 


Nucleotide 


92.3 


89.6 


92 


91.6 


92.3 




Amino acid 


95.8 


89.9 


90.6 


90.5 


95.5 


RV97 


Nucleotide 


95.9 


92.5 


92.3 


93 


94.9 




Amino acid 


96 


87.9 


90.6 


89.7 


96.1 


SAD-B19 


Nucleotide 


92.5 


89.6 


91.1 


91.2 


92.2 




Amino acid 


96.2 


88.9 


87.7 


89.7 


95.3 


SHBRV-18 


Nucleotide 


84.2 


79.6 


80.8 


80 


82 




Amino acid 


94.7 


83.9 


88.7 


84.2 


5.8 



It is assumed that the aG strain in China is more clo- 
sely with rabies virus in northern and northeast part of 
China, since these strains are closely related geographi- 
cally and ethnically. The phylogenetic tree based on the 
N gene showed that aG strain clustered with the Japa- 
nese vaccine strain, supporting the idea that rabies virus 
from the same region tend to cluster together. 

Consistent with previous observations that viruses 
from the same geographical area tend to group together, 
the aG strain tends to cluster most closely with DRV, 
which was isolated from deer in Jilin province, China. 
Also, the aG strain grouped closely to other vaccine 
strain, especially the vaccine strain from Japan [20]. 
However, the aG strain did not share high nucleotide 
homology with wild strain, such as FJ008, FJ009 and 
HN10, making aG not the best vaccine in China [21], 
although the strain aG has been employed for human 
vaccine strain in China for a long time. Of course, more 
research is needed to confirm the hypothesis. 

To date, several methods, such as rapid amplification of 
cDNA (RACE) [22-24] and gene-walking approach have 
been employed to obtain the whole genome sequence, 
including the 5' and 3' untranslated regions (UTR). In the 
present study, a simple method of amplification and 
sequencing of full length of rabies virus genome was per- 
formed, according to reference 7. Before this study, simi- 
lar method has been used to sequence the genome of 
different viruses [25]. Compared with the method pre- 
viously, the method described here is easy to perform, 
does not need complex principle and does not need 
expensive device; thus it is recommended for other virus. 

Comparison of nucleotide sequences of rabies virus 
available in Genbank with aG strain showed extensive 
divergence excluding the transcription regulatory signals 
and limited stretches of the nucleotide. However, 
according to the phylogenetic analysis of the N gene of 
aG strain, the strain probably emerged from the DRV, 
or they may share the same ancestor, which was isolated 
deer in Jilin province, China. More researches are 
needed to determine whether the aG strain came origin- 
ally as a result of the DRV adapting to the dog host or 
not. The Ser 389 , the putative casein-type phosphorylation 
site and regulation of viral RNA transcription and repli- 
cation was conserved in strain aG. The antigenic sites in 
strain aG were conserved, indicating the antigenicity of 
aG was well retained, which was suitable for vaccine 
production. 
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Figure 2 Alignment of Antigenic sites I and IV, RNA-binding site of N. Dots represent identity to the majority, shade (with solid black) 
residues differ from consensus. A. Alignment of Antigenic sites I and IV. Dots represent identity to the majority. B. Alignment of RNA-binding site 
of N, shade (with solid black) residues differ from consensus. 
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Figure 3 Alignment of N binding sites (69-177), (268-297)and [(K/R) XTQT], dots represent identity to consensus A. Alignment of N 
binding sites (69-177) and [(K/R) XTQT], dots represent identity to consensus. B. Alignment of N binding site (268-297), dots represent identity to 
consensus. 
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Figure 4 Alignment of proline-rich motif and residue 58. Dots represent identity to the consensus 
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Figure 5 Alignment of antigenic antigenic sitel (residue 231), site II (residues 34-42), (residues 198-200), shade (with solid black) 
residues differ from the consensus. A. Alignment of antigenic site II (residues 34-42), shade (with solid black) residues differ from the 
consensus. B. Alignment of antigenic site I (residue 231) and II (residues 198-200), shade (with solid black) residues differ from consensus. 



P has been considered to be the most divergent of the 
five coding protein. The LC8-interacting motif was con- 
served in aG strain, indicating that the motif serve to 
transport the virus through neurons. Also, the L-binding 
region of P, as well as the N-binding site of P was also 
conserved, except Thr 174 , GIU295. These results, together 
with the five Ser and four Met, indicating that the same 
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Figure 6 Alignment of motif A, B, C. shade (with solid black) 
residues differ from consensus. A Alignment of motif A. shade (with 
solid black) residues differ from consensus. B Alignment motif B. 
shade (with solid black) residues differ from consensus. C. 
Alignment of motif C. shade (with solid black) residues differ from 
consensus. 



function which have been reported in fixed strains were 
also retained in aG strain. Mebatsion and Rasalingam 
questioned the function of the interaction between P 
and LC8 by constructing of recombinant virus in the 
absence of the LC8 binding site. The results indicated 
that the virus lacking the binding site was as pathogenic 
as wild virus, suggesting the interaction is not indispen- 
sible to the spread of RV. 

M protein has been considered to be important in 
membrane budding and interacting with cellular compo- 
nents. Finke found that substitution of Arg 58 to Gly 58 
reduced the expression of full-length RNA. The aG 
strain retains the Glu 58 , suggesting that it may serve to 
RNA regulation. 

The G protein is related to cell attachment, responsible 
for the induction of neutralizing antibodies and cell- 
mediated immune response [26]. Ala242, ASP255, Ile268> and 
Arg 3 33, which are correlated with pathogenicity of rabies 
virus, were encoded Cys, Met, Thr and Ala respectively. 
The antigenic sites were all retained in aG. These results 
suggest the antigenicity of G protein was retained. However, 
the strain aG was strongly attenuated, although other fac- 
tors may be responsible for the pathogenicity of rabies virus. 

The L protein, together with P compose RNA poly- 
merase complex, which is involved in enzymic activities 
of polymerase. The RNA-binding region, active site of 
polymerase, the Pro residues rich motif GXGXG, which 
is involved in polyadenylation or protein kinase activ- 
ities, were conserved; These motifs, together with GHP 
were conserved in strain aG. All These results suggest 
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Figure 7 Phylogenetic analysis of N gene sequence of aG with nucleoprotein sequence available in Genbank by Neighbor-Joining 
program in MEGA 5 software. The number below the brace is bootstrap values for 100 replicates. 



the L protein possess the same function as other strain. Conclusion 

However, little is known about the mechanism of the In this report, we analyzed the full genome of China 

function of L protein, therefore, more researches are human rabies vaccine strain aG. Our studies indicated 

needed and necessary. that the genome of aG retained the basic characteristics 
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of RV, including the organization of the genome, the 
functional domains of the each ORF. At gene level, N 
was the most conserved among the five coding genes, 
indicating We this gene is the most appropriate for 
quantitative genotype definition. The phylogenetic analy- 
sis of the N indicated the aG strain clustered most clo- 
sely with Japanese and Russian rabies vaccine strains, 
suggesting that they may share the same ancestor; also, 
the aG strain did not share high homology with wild 
strains isolated from China, making it may not be the 
best vaccine strain, more research is needed to elucidate 
the genetic relationship among the RV circulating in 
China. 
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